Comparing Apples to Apple: The Effects of Stemmers on Topic Models

نویسندگان

  • Alexandra Schofield
  • David M. Mimno
چکیده

Rule-based stemmers such as the Porter stemmer are frequently used to preprocess English corpora for topic modeling. In this work, we train and evaluate topic models on a variety of corpora using several different stemming algorithms. We examine several different quantitative measures of the resulting models, including likelihood, coherence, model stability, and entropy. Despite their frequent use in topic modeling, we find that stemmers produce no meaningful improvement in likelihood and coherence and in fact can degrade topic stability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic detection of apple mealiness based on support vector machine

Mealiness degrades the quality of apples and plays an important role in fruit market. Therefore, the use of reliable and rapid sensing techniques for nondestructive measurement and sorting of fruits is necessary. In this study, the potential of acoustic signals of rolling apples on an inclined plate as a new technique for nondestructive detection of Red Delicious apple mealiness was investigate...

متن کامل

Quick Estimation of Apple (Red Delicious and Golden Delicious) Leaf Area and Chlorophyll Content

ABSTRACT- The evaluation of leaf area and leaf nutritional value is important for crop growth modeling and estimations of its performance. The purpose of this study was to use image processing techniques to develop an economical method to ease the assessment of nutrient status and leaf area (LA) of plants and to compare the outcomes of this method with linear models. Leaf area and leaf chloroph...

متن کامل

Effects of Natural Mucilage as an Edible Coating on Quality Improvement of Freshly-cut apples

Background and Objectives: Production and consumption of freshly-cut fruits have been increased in recent decades.  One of the major problems in storage of freshly-cut fruits, the color change, is a result of the oxidative reactions of phenolic compounds by polyphenol oxidases. Various treatments such as coating and refrigeration are used to improve quality and shelf-life of the fresh-cut fruit...

متن کامل

Influence of 1-aminoethoxyvinylglycine hydrochloride and α-naphthalene acetic acid on fruit retention, quality, evolved ethylene, and respiration in apples

Effects of 1-aminoethoxyvinylglycine hydrochloride (AVG or Aviglycine HCl or ReTain) and α-naphthalene acetic acid (NAA) on fruit retention, fruit quality, eveloved ethylene, and respiration in ‘Rome Beauty’ and three ‘Delicious’ apple cultivars (Malus domestica Borkh.) were studied.  The experimental trees were treated with either AVG, applied at 120 g a.i. per 935 L. ha-1 or NAA, applied at t...

متن کامل

Effects of Harvest Date on Apple Fruit Quality at Harvesting and after Cold Storage

    Different dates for apples fruit harvest (Malus domestica Borkh. Cv. Fuji) were studied to determine physiochemical changes during the storage. Fuji apples were harvested from 9 September till 23 October, at five different times and stored at 0±0.5 °C and 95% relative humidity for 120 days. To determine the best harvest date for maximum quality and storability, physical and chemical paramet...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • TACL

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2016